Explanation: Chroma & Timbre Analyse
Chroma-based Self-Similarity: In this Self-Similarity Matrix, we identify gridlike patterns in the track, suggesting repeating harmonic structures, especially in the area from 50s-100s, as well as near the end. Overall the track we see repeated patterns, suggesting recurring motifs.
Timbre (MFCC) Self-Similarity: Here we see bright edges, implying that the intro and outro are similar. In the middle, we get segments where we have darker regions, especially from 90s-140s, meaning variations in timbre. We see patterns that reoccur meaning that we have repeating timbral elements throughout the track.
Chromagram: In the Chromagram, we observe that the A note has a prominent frequency, especially from 55s and on. We also observe that between 00s-55s and 110-150s we have a structure with a presence of different pitches, meaning we have a varied harmonic structure.
Cepstrogram:: We observe a quite stable cepstrogram throughout the track, meaning that there are no drastical changes throughout the track.
In this project, I analyze my tracks ‘desmond-l-1’ and ‘desmond-l-2’ compared to the Computational Musicology 2025 corpus using Essentia features such as danceability, energy, valence, and instrumentalness.
I curated my tracks using generative AI tool Stable Audio. The prompts I used were based on two of my favorite artists: KI/KI and Marlon Hoffstadt. Both DJs are known for their high energetic music, combining acid and trance with techno.
The table below presents the extracted Essentia feature values for my generated tracks.| filename | approachability | arousal | danceability | engagingness | instrumentalness | tempo | valence |
|---|---|---|---|---|---|---|---|
| desmond-l-1 | 0.4008293 | 5.525241 | 0.6376823 | 0.8862919 | 0.7578623 | 93 | 5.837563 |
| desmond-l-2 | 0.3221814 | 6.353199 | 0.9561019 | 1.0029999 | 0.5750142 | 100 | 5.886079 |
This graph explores the relationship between arousal and valence, with the danceability represented by the color.
My tracks, desmond-l-1, and desmond-l-2 score high in arousal and valence, meaning they are energetic and also emotionally positive. This aligns perfect with the sound of KI/KI and Marlon Hoffstadt.
Compared to the class corpus, my tracks exhibit a higher combination of energy, emotional positivity, and danceability. This suggests that they are: